Support Vector X

/assets/images/svm-1.png|500

Basic terms

Support Vector

Margin

= the shortest distance between the hyperplane and the closest data points (support vectors)

Soft margin

= the margin that allows misclassifications. Support vector machine (SVM) uses a soft margin.

Support Vector Regression (SVR)

=> performs regression, continuous data

Kernels SVR = non-linear

/assets/images/svm-2.png|500

Support Vector Machine (SVM)

=> performs classification, discrete data

Support Vector Machines use Kernel Function to systematically find Support Vector Classier in higher dimension.

Kernels SVM

Why we need kernels

Kernel function is kind of a similarity measure. The inputs are original features and the output is a similarity measure in the new feature space.

Given that classification can be non-linear
=> mapping to a higher dimension (implicitly during the cost function optimization) can help:

assets/images/svm-3.png|500
=> however, the mapping can be highly compute-intensive
=> the Kernel trick can help with that:
assets/images/svm-4.png|500
More than one kernel can be good too:
/assets/images/svm-5.png|500

Types of Kernel Functions

Hyperparameters

Pros & Cons

Pros

Cons